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Abstract 

It is known that polynomial filtering can accelerate the convergence to¬ 
wards average consensus on an undirected network. In this paper the gain 
of a second-order filtering is investigated. A set of graphs is determined 
for which consensus can be attained in finite time, and a preconditioner 
is proposed to adapt the undirected weights of any given graph to achieve 
fastest convergence with the polynomial filter. The corresponding cost 
function differs from the traditional spectral gap, as it favors grouping 
the eigenvalues in two clusters. A possible loss of robustness of the poly¬ 
nomial filter is also highlighted. 


1 Consensus acceleration 

Since their introduction in [1], (discrete-time) consensus algorithms have at¬ 
tracted almost as much attention as their dual, fast mixing Markov chains mis]. 
Improving the convergence speed of this basic building block for e.g. distributed 
computation and sensor fusion has been a major focus. For synchronized fixed 
networks, one can optimize the weights on the links j5] , add local memory [1] , or 
introduce time-varying filters The present paper establishes the benefit 

of combining polynomial filtering with optimization of link weights. 

Consider an undirected and connected graph C/(V,£) with N nodes G V and 
M edges G £. Denote the node states as x = (xi ,X 2 , ■■■, xn) G ■ The basic 
linear consensus dynamics on Q is 

Xi{k+1) = Xi{k) +w'^L^j{xj{k)-Xi{k)) (1) 

with fc G N, ui G R some gain and L a symmetric matrix of edge weights called 
the Laplacian. We can rewrite Q as 

x{k+l) = {I — wL) x{k) = Pyj x{k ), 
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with I the identity matrix. Consensus is the stationary state satisfying 

Xi = X 2 = = Xn =: c or equivalently x = x 

(if the graph is connected); and average consensus requires the consensus value 
c to satisfy c = ^ J2f=i ^*(0)- The convergence speed of Q towards consensus 
is governed by Pu,’s second largest eigenvalue modulus (SLEM) 

fJ-{Pw) = max{|a;^Puja:| : x'^x < 1, Ylk=i^k = 0} 

or the “spectral gap” 1 — fi{Pw)- For a given graph, the fastest convergence 
is obtained by choosing the edge weights to maximize the spectral gap. This 
is a convex problem [5], whose solution we call the Fastest Single-Step Con¬ 
vergence network (FSSC)[^ In case of a fixed L, with knowledge of bounds 
A < A on its eigenvalues, the optimal u is obtained by choosing w such that 
/r = (l-u;A) = -(l-A). 


Accelerated consensus denotes the expansion of node and communication 
features, under the same graph constraint, to improve convergence speed. 

In [3] and later papers, memory slots are added at each node to get 

a;(fc-|-I) = (I — wiL)x{k) + W 2 {x{k— I) — a;(A:)) (2) 


with some fixed wi,W 2 € K- Extending the memory registers does not improve 
convergence speed if only bounds A < A on the eigenvalues of a fixed L are 
known [7]. 

Another approach is polynomial filtering, where a time-varying choice of P^ 
accelerates convergence UM- The idea is that if L has eigenvectors xi with 
eigenvalues Xi, then after t steps of Pw{k) = I — w{k)L, each eigenvector has 
been multiplied by pt{Xi) = — w{k)Xi). Through choosing a set of w{k), 

Pt can be any polynomial of order t satisfying pt(0) = 1; whereas constant w 
restricts to Pt{Xi) = (1 — wXiY. Choosing w(k) = l/A^ would imply finite-time 
convergence. The latter requires not only to implement a high-order polynomial 
filter (see Section 4.1), but also to know the eigenvalues exactly. In the opposite 
case where we only know that the eigenvalues lie in a bounded interval, we must 
apply the following result. 


Proposition 1. If the eigenvalues of L span the whole interval Xi € [A, A], 
then the optimal memory slot dynamics (H is at least as fast as 0 with any 
time-varying w (i.e. any order of polynomial filtering). 


Proof. [S] gives the optimal polynomial filter and its worst-case long-term con¬ 
vergence speed: for w such that p, = (1-wA) = — (l-icA) we have maxp_.^,>,|<^[pt(A)] 

^Boyd et al. study the optimization for Markov chains, with the constraint [P,n]i,j > 0 
for all i,j. For consensus that constraint also favors robustness to network changes. In the 
present paper we drop it, for a fairer comparison with other algorithms; the optimization 
remains a convex problem. 
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1/ |T((1///)| with pt rescaled from a t-order Chebyshev polynomial Tt. A func¬ 
tion plot readily shows that 


1 1 n 
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|rt(l//r)|^/‘ M V 

for all p G (0,1), with equality as t —)■ +oo. The right side of (§ is precisely 
the worst-case convergence speed with optimal (D, see HIT]. □ 

Whether a synchronous variation of ic or a local memory slot is a more 
demanding resource, is application dependent. One local memory allows to 
evaluate Tt with a stable recurrence [4]. Our new observation with Proposition 
[^is that with respect to this implementation, ([^ is superior. 

However, in the following, we show how much more can be gained by the 
polynomial filter when (somewhat) more is known about L. We restrict our 
investigation to a two-step alternating scheme: 

x{k -I- 2) = {I-W 2 L){I-wiL) x(k) , (4) 

performing second-order polynomial filtering on the L-spectrum (or equivalently 
on the P-spectrum). 

Note that acceleration schemes have also been proposed for Markov chains 
(see e.g. and related work). However, the most prominent schemes explicitly 
build on full knowledge of a given Q, which differs from our setting. 

The remainder of the paper is organized as follows. In Sectionj^we compute 
the optimal second-order polynomial and the associated gain in convergence 
speed. We also mention some special graphs that allow 2-step consensus with 
possible symmetry-breaking. In Section we consider the optimization of graph 
weights towards polynomial filtering. Section discusses the implementation of 
the acceleration scheme and its robustness properties. 

2 Optimizing the polynomial filter 

We now investigate the form of the optimal second-order polynomial filter Q 
for a given graph Q, about which we possibly know more than just the spectral 
gap. Before the rescaling of L in Q, we can assume without loss of generality 
that P := I — L is centered, i.e. —fj,{P)x'^x < x^’^Px < p,{P)x"'"x for all x with 
EiLi — 0, with each equality achieved for some a:, and where p{P) is the 
STEM of P. In this situation, the optimal polynomial filter only depends on 
P’s STEM and Smallest Eigenvalue Modulus cr{P) (SEM). 

Explicitly, rewrite Q as 

x{k + 2) = ^ —— ~x{k) =:p 2 {P)x{k) (5) 

(I - Zi)[l - Z 2 ) 
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and denote by {Xi : i = 1,2,..., N} the eigenvalues of P, with Ai = 1 correspond¬ 
ing to the consensus eigenspace. Note that the filter p 2 can be any second-order 
polynomial restricted to p 2 (l) = 1- We define the optimal exponential conver¬ 
gence rate as: 

/r 2 (-P) = min max|p 2 (Aj)| . 

Zl,Z2 i>l 


Theorem 1. Consider a connected, undirected graph Q with given (centered) 
weight matrix P. The optimal convergence rate attainable through p 2 -acceleration 
(1^ is given by 

p{Pf-a{Pf 

^ 2 -/i(P)2 - a(P)2 

and obtained with the unique polynomial 

P"^ - p{Pf (2 - a{Pf 12 


P2{P) = 


1- p.{PY/2- a{PY-/2 


(7) 


Proof. For any [|a|, |&|] C [0,1], the second-order polynomial P 2 ix) for which 
P 2 { 1 ) = 1 and which minimizes |P 2 (a;)| is determined by P 2 (&) = 

P 2 {—b) = —p 2 (a) = —p 2 (—a)- Replacing |a|,| 6 | by particular eigenvalues 
a{P), p{P), this polynomial bounds the best possible performance. Moreover, 
for this same polynomial we have |_P 2 (a^)| < |P 2 (^)| for any |a:| G [a,b], so it 
actually gives the best performance, independent of the other eigenvalues of 
P. □ 


The convergence rate P 2 {P) must be compared with p{P)^, the convergence 
over two steps of the standard consensus algorithm 0 . The resulting improve¬ 
ment by p 2 -acceleration is illustrated on Figure 


2.1 Favorable graphs and symmetry breaking 

In light of the previous result, a given weighted graph Q will allow substantial 
acceleration through second-order filtering if its (e.g. Laplacian) eigenvalues are 
clustered in two sets whose width is small compared to the distance between 
the sets. This situation easily generalizes to the case of n clusters for n-order 
polynomial filters, at least conceptually; explicit forms generalizing 0,0 may 
be more difficult to obtain. 

This observation motivates a different way of optimizing network links, if 
some design freedom is available. Before turning to an investigation of such 
optimization, we briefly consider the special cases where finite-time convergence 
to consensus is achieved in two steps. 

The study of finite-time consensus has its own line of work, to which the 
present results connect through HIH] and the like. Obviously, if a P matrix in 
our framework has only two distinct eigenvalues A 2 = —A 3 besides the invariant 
space with Ai = 1, then ct(P) = p{P) and 0,0 imply perfect consensus after 
one application of Q. Graphs Q for which Pi, P 2 can be selected such that 
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Figure 1: Comparison of SLEM after 2 steps of standard consensus (fi^) and con¬ 
vergence rate nnder optimal p 2 -acceleration for some arbitrary P. The critical 

eigenvalnes, determining the convergence rate, are distinguished (x) from the other 
ones (o). The polynomials i/ = (dotted lines, standard consensus) and y = P 2 (A) 
(plain line, polynomial filter) illnstrate the acceleration mechanism. 


Pi P 2 features deadbeat convergence are characterized in [5] . The polynomial 
filter is restricted to P 2 = cl + dPi, with c,d G M.. It is presently unclear how 
much more restrictive this is, but the following results nevertheless carry over 
from [6]. 

Proposition 2. • If 2-step convergence can be reached on a graph Q then its 
diameter d{Q) < 2. 

• d{Q) <2 is not sufficient for 2-step convergence. 

• If in addition Q is distance-regular, then 2-step convergence can be attained. 

The condition d{Q) < 2 turns out to be sufficient for iV < 5 at least, showing 
that 5 out of the 6 four-node graphs (resp. 15 ouf of 21 five-node graphs) converge 
in two steps with p 2 -acceleration; while for ([^ only the complete graph converges 
in finite time. Lists of graphs with only two different nonzero eigenvalues can 
be found in the literature, see e.g. [HI Table 14.2, Table 14.4, Chapter 15.2] 
assuming uniform weights. 

Examining the complete bipartite graphs K (£, m) between sets of i and m 
nodes allows to illustrate interesting finite-time convergence properties. Both 
/C(m, m) and the star graph /C(l, m) allow finite-time convergence using p 2 with 
uniform link weights, although only ]C{m,m) is distance-regular. For the other 
IC{£, m) cases, uniform weights lead to 3 distinct nonzero eigenvalues. Regarding 
(FSSC) this choice is optimal, see [HI[TO]- Regarding p 2 , a nonuniform weight 
selection might further accelerate the convergence. In particular, a symmetry 
breaking on lC(2,m) does allow finite-time convergence. Indeed, with asym- 
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metric weights p ^ q (see figure , the nonzero eigenvalues are proportional 
to: 

I (m+l)(p+q) (m+l)2(p+5)2_4pqm(m+2) 

p -t q, 2 ^ 2 

By choosing q = ^ (m ± y/m? — 4)p, this set reduces to two distinct values and 
P 2 {P) = 0. This possible benefit of symmetry breaking contrasts with standard 
consensus Q, for which m shows that keeping the edge-transitivity symmetry 
in the weights does lead to the FSSC. 



Optimization of the edge weights towards p 2 -acceleration, for a given graph 
structure, is further considered in the following section. 


3 Preconditioning graphs: spectral clustering 

Theorem [2 describes the convergence rate obtained with an optimal polynomial 
filter for a given weighted graph (spectrum). This section treats the optimization 
of the edge weights in compliance with the graph connectivity, which we call 
preconditioning the graph towards p 2 -acceleration. This is in the same spirit 
as the FSSC. However, optimal polynomial acceleration favors an eigenvalue 
spectrum in distinct clusters. A positive point is that Theorem gives an 
explicit expression for the optimal polynomial, so we can efficiently concentrate 
on optimizing P. For a given ^(V, £^), we denote by P* the FSSC i.e. the optimal 
choice of weights for Q , by P *2 the optimal weights for p 2 acceleration. 

3.1 Bound on fastest convergence rate 

We first get a bound on the possible acceleration. Given Q, denote its square 
graph, in which two nodes are linked by an edge iff in G they are linked by a 
path of < 2 edges. We denote an arbitrary weight matrix on this power graph 
by Pg 2 . 

Proposition 3. The fastest convergence rate attainable on G using p 2 -acceleration 
is bounded by the STEM of the Fastest Single-Step Consensus on G^, i.e. 

P2{p;j > p{P*g2) 
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Proof. For P 2 {P) we effectively tune the same edge weights as for Pg 2 , except 
that in the latter the individual weights are unconstrained while in the former 
they are interdependent through the polynomial structure. □ 

This bound is not always tight: if the diameter d{Q) < 2 then is com¬ 
pletely connected and p{Pq 2 ) = 0. The second item of Propositionhowever 
states that this does not imply p 2 {Pp 2 ) = 0. 


3.2 Numerical optimization 

The FSSC choice of P* does not necessarily minimize p 2 , see e.g. the IC{2,m)- 
graph (section 2.1); an alternative optimization is required. 

Expressed as a function of the individual edge weights, P 2 {P) as defined in The- 
oremj^is not a convex function, unlike /r(P) for the FSSC. Given the convexity 
of ^(•) as a function of its argument and inspired by Proposition]^ we try to 
reformulate the problem as minimizing fj,{Pg 2 ) over all Pg 2 which can be repre¬ 
sented as p 2 {P). The resulting relaxation however is not expected to improve 
convexity as the set of all P 2 {P) is a non-convex subset of the set of all Pg 2 (see 
Proposition in appendix). So to have a convex problem we indeed need to 
relax the subspace to all Pg 2 . The solution of the relaxed problem would then 
have to be reprojected into the set of all p 2 {P). And for this, it is well-known 
that variations on matrix elements give little insight on the induced variations 
in eigenvalues. 

Thus optimal preconditioning towards p 2 "a'Cceleration seems significantly 
more difficult numerically than FSSC, unless a better reformulation is found. In 
the meantime, we have performed a numerical optimization by gradient descent, 
minimizing the explicit function pL 2 as a function of the edge weights. We claim 
by no means that this is the best strategy in terms of complexity or results, at 
this point it is just a feasible one to evaluate the potential of preconditioning for 
P 2 "acceleration. As the used formulation is non-convex, the method converges to 
a local minimum, not guaranteeing the global optimum. The bound p{Q^) from 
Proposition can give an indication on the quality of the obtained P matrix. 

Figure [Sjshows the spectrum of an Erdos-Renyi graph with 20 nodes, 96 
edges and diameter 2. Its weights have been optimized respectively for FSSC 
(P*) and for acceleration preconditioning {P* 2 , hopefully). The difference is 
graphically striking as the preconditioning brings fi 2 {P*) = 0.0626 down to 
P 2 {Pp 2 ) = 0.0088. This is further to be compared to p?{P*) = 0.1181 for the 
FSSC without p 2 acceleration and to the lower bound ^{Pgi) = 0. 

Beneficial clustering of the eigenvalues around two polynomial zeros ±z as on 
Fig .|^is not always possible. It typically deteriorates with decreasing number of 
edges per node, thus as the amount of degrees of freedom available for optimizing 
a constant number of eigenvalues decreases. That this is not a general rule 
follows from the perfect clustering of star graphs (section [2T] ). We observe the 
following behavior in simulations. 


• The behavior of the preconditioner and acceleration was examined on a 
large set of Erdos-Renyi graphs. A clear trend appears when examining 
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Figure 3: Spectral clustering performed on an Erdos-Renyi graph C/(20,9). The plot 
shows the spectrum of both the FSSC P* and the optimized Pp 2 , as well as their 
optimal p 2 -polynomials and corresponding /i 2 values. 


the average results as a function of the density, i.e. the number of edges 
compared to the complete graph, as shown in figure When less than 
about 30% of the edges are present w.r.t. the complete graph, the most 
significant acceleration is obtained by just taking the graph optimized for 
the FSSC, which is easily computable, and applying the optimal polyno¬ 
mial hlter to it instead of first-order consensus; trials to further adjust the 
weights towards faster convergence do not really pay off (in average). For 
densities higher than about 30 %, the preconditioner starts to signihcantly 
pay off, with gains up to orders of magnitude. 

• This behavior and the previously mentioned lower bound of Proposition]^ 
might suggest that a sort of phase transition should appear as a function 
of the diameter of the graph. To investigate this we have partitioned 
the set of Erdds-Renyi graphs of a given density, as a function of their 
diameter. Except for the graphs with finite-time convergence at diameter 
2 , the acceleration ratio achieved by the preconditioner with respect to the 
FSSC appears to have the same distribution on these subsets. Moreover, 
the lower bound of Proposition]^ was reached on several graphs of diameter 
3,4 and 5. This seems to indicate that the graph diameter is not a limiting 
factor for the preconditioner. Surprisingly, it appears that the role of the 
graph diameter in possible convergence speed is an open question in the 
literature also for the FSSC/FMMC problem. 

• The optimal P matrix sometimes contains negative elements, also on off- 
diagonal entries. This indicates that repulsion between certain nodes can 
accelerate consensus with p 2 ■ Such negative entries however can be unde¬ 
sirable for robustness, in which case one can easily exclude them in the 
optimization process. 

• Unsurprisingly, a spectrum well clustered with the preconditioner is often 
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density 

Figure 4: Performance of clustering on an Erdos-Renyi graph with increasing density. 
Depicted are p(P*)^ using the FSSC during two steps, its p2-acceleration /i2(P*), 
the p2-acceleration with optimized graph p2(Pp2) (hopefully), and the lower bound 

KPs^)- 

highly degenerate in ±/r and ±cr. This is similar to the degeneracy found 
at ±p, in the spectrum of the FSSC. 


4 Robustness 


4.1 Implementation in alternating steps 

Implementation of the acceleration scheme Q requires to sequentially apply 
two matrices P_ and Pp, such that 


P-P+ = 


P^-z^ 

1-Z2 ’ 


with 


MP)^ + a(P)2 
2 


A linear implementation 0 compatible with one-step communication links for 
fixed P, necessarily takes the form 


P_ = a 


P-z 
1 - z ’ 


P+ 


I P + z 
a 1 + z ’ 


a e K. (8) 


The effect of P_ and P+ on a given eigenvector is obtained just by replacing P in 
^ by the corresponding eigenvalue. We immediately see that the eigenvalue 1 
of P, corresponding to the consensus eigenvector, will be multiplied alternatively 
by a and by 1/a. In this sense a>l(orl/a>l) thus implies an “unstable” step 
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with P_ (or with P+) for the consensus eigenvector. The consensus value is kept 
at each step only if a = 1, else it is recovered every second step. However, taking 
a 1 can be interesting regarding the remaining eigenvalues of P. We say that 
the iteration towards consensus involves stable steps if both ia{P+) < 1 and 
p{P-) < 1, i.e. they are stable on the eigenspace orthogonal to the consensus 
eigenvector associated to the trivial eigenvalue 1 of P. 

Proposition 4. For given z: • The matrices P+ and P_ with a = 1 are both 
stable for consensus if and only if p,{P) < 1 — 2z. 

• The iteration towards consensus involves stable steps for a proper choice of 
a 1 if and only if p{P) < 1 — z. 

For given pL{P) it is always possible to restrict z such that these stability con¬ 
ditions are satisfied, possibly by taking it smaller than the optimal acceleration 
value z^ = (/r(P)^ + a{P)'^)l2. 

Proof. Choosing a = 1 we get, by linearity of both P+ and P_ in P and hence 
in its eigenvalues: max{|^(P+)|, |^(P_)|} = (^(P) + z)/{l — z). The latter is 
smaller than 1 if and only if p{P) < 1 — 2z; while the eigenvalue 1 of P remains 
unchanged in P+ and P_ when a = 1. Choosing a = \/l — zj^/l + z minimizes 
max{|^(P+)|, |/r(P_)|}, at the value (^(P) + z)l\/l — The latter is smaller 
than 1 if and only if ^(P) <1 — z. □ 

Figureillustrates the second case of Proposition]^ The stability of indi¬ 
vidual steps can be relevant if we cannot ensure that all intended steps will be 
applied, e.g. due to synchronization issues. For standard consensus with P this 
implies no problem, just skipped steps. But in accelerated consensus with a = 1, 
if by chance P_ is applied more frequently than P_|_ in the situation of Fig. 
then one mode increases in an unstable way. We can prevent this risk of insta¬ 
bility with Pff, which takes a 1. In this case, more frequent applications of 
P_ will just change the consensus value to something (unstably) different from 
the average of initial values, but it will not prevent the agents from converging 
to consensus. The preferable tradeoff depends on the practical situation. 



j_^____^_L 

1 2 3 4 5 6 7 


step t 

Figure 5: Convergence of the individual steps when using standard consensus (P), 
optimal p 2 acceleration with a — I (periodic repetition of P+P_ ...) and with a = 
VI — zjyjl F z (periodic repetition of ^ for the graph with optimized weights 

shown in figure]^ 
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4.2 Robustness to link failure 


Consider a consensus scheme tailored to an initial undirected graph Q with 
weight matrix P, featuring positive weight on each link. A sudden edge failure 
leads to a modified graph Q' with weights P'. The standard consensus dynamics 
will remain stable under this failure, as P' remains doubly-stochastic, see e.g. 
Link failure affects the p 2 -accelerated dynamics as follows. 


Proposition 5. • For certain graphs, the optimal p 2 filter for a weight matrix 
P associated to positive edge weights can become unstable with weights P' in 
which a link has failed (permanent link failure), and also if the link fails one 
step out of two (resonant link failure). 

• Restricting z < 1(^/2 (resp. z < {1 — p{P))/2) in the polynomial filter ensures 
stability under permanent (resp. resonant) link failures from a P associated to 
positive edge weights, as for standard consensus. 

• Any P matrix which features some negative edge weights can become unstable 
under specific link failures, both for standard consensus and with p 2 acceleration. 

• Robustness to link failure is independent of the choice of a in §. 


Proof. The last point results from the fact that even if links fail at some times, 
in absence of other casualties, the scheme keeps alternating the two steps of Q, 
possibly with different P matrices but still with the a factors canceling. 

The third point is trivial if we consider the case where all links with positive 
edge weights (attraction between agents) fail, while all links with negative edge 
weights (repulsion between agents) remain. Indeed, this leaves only repulsive 
dynamics and we can write P' = I + L' where L' is a Laplacian with non¬ 
negative eigenvalues - i.e. some eigenvalues of P' will necessarily be larger than 
1. Let us now turn to the case of P restricted to positive edge weights. 

Under permanent link failure, the polynomial filter is applied to a different set 
of eigenvalues S [—1,1], all closer to 1 than the original ones M- If b2(A)| < 1 
for all A G there is no risk of instability. However, if P2{0) < —1, 

an eigenvalue of P m^ht become close to zero after link failure and lead to 
instability, see Figure]^ This leads to the condition z < l/-\/2. 

Under resonant link failure, when corresponds to all links failing and P'_ to 
no link failure, the eigenvector of p{P) is multiplied by over two time 

steps. This is the worst case: just consider the worst graph achievable with link 


failures separately for each step. The condition expresses 


-u(P)-z 

1-2 


< 1 . 


□ 


The instability under link failure is caused by a potentially unstable region 
on the polynomial, characteristic of highly clustered spectra. Figure]^ shows a 
5-node graph whose clustered spectrum has an unstable region in the center, 
i.e. IP 2 I > 1 on some interval inside [—1,1]. Indeed, when any of the dashed 
edges fails, one of the eigenvalues hops into this region. For both permanent 
and resonant failure the scheme will turn unstable. Constraining the polynomial 
according to Proposition restores robustness of the scheme but lowers its 
acceleration. 


II 







Figure 6: The graph on 5 nodes defined by the union of full and dashed edges on the 
left, is preconditioned for p 2 acceleration with weights (1,2), (2,4) = 0.628, (1,4) = 
0.605, (1, 3), (3,4) = 0.045, (3, 5) = 0.926. Its (highly clustered) spectrum and optimal 
P 2 are shown on the right. Failure of any of the dashed edges leads to instability as 
an eigenvalue moves into the unstable region of P 2 , as indicated by the arrow on the 
spectrum. 


5 Discussion 

5.1 Higher Order Polynomial Filtering 

While this paper focuses on second-order acceleration, several results can be 
extended to arbitrary order polynomials pm while others remain open. 

• The closed form for the optimal polynomial — if there actually is a unique 
one — still needs to be investigated. It will certainly exploit more graph 
information than fi{P) and cr{P). 

• Graphs allowing finite-time convergence can be investigated, with obvious 
improvement as M tends to N, the graph order. Note that still for p 2 , 
we do not have an exact answer to the question, among others due to the 
necessity to consider beneficial symmetry-breakings. 

• The bound of Proposition [sl holds verbatim in the form pM^Pp ) > 

• The non-convexity of Proposition holds, replacing each edge in the ex¬ 
ample of the proof by a path of appropriate length. 

• For the gradient descent and associated investigation, a closed form for 
the optimal polynomial would be welcome, else the polynomial parameters 
can be part of the optimization variables (see e.g. 0). 

• The robustness discussion remains qualitatively the same. 

However, relevant situations for practical implementation of consensus with 
high-order polynomial filters would probably be the first question to consider. 

5.2 Conclusions 

In this paper we have characterized the possibilities to accelerate linear consen¬ 
sus by second-order polynomial filtering as proposed in [5]. We have observed 
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that this strategy is beaten by an acceleration based on local memory slots if 
only an upper and a lower bound are known on the graph spectrum. However 
when more is known about the graph spectrum, performance can be improved 
significantly. For a graph with fixed weights the optimal filter and its conver¬ 
gence rate were derived exactly. A preconditioner is proposed which optimizes 
the edge weights of a given graph, clustering its eigenvalues towards better poly¬ 
nomial acceleration. Unlike for standard consensus this optimization appears to 
be non-convex. Significant payoffs are obtained especially for graphs with high 
edge density. 

A few academic questions remain open. A particular one is whether the 
possibility of achieving consensus in k steps with a time-varying weight matrix 
on a fixed graph implies that a fc-order polynomial filter can also achieve 
finite-time consensus. A more general question, for which we have been surprised 
to find no answer in the literature even regarding standard consensus algorithms, 
is how the diameter of a graph might bound the best achievable convergence 
rate with optimized edge weights. 

In a broader scope, we notice that an approach similar to polynomial filter¬ 
ing has been proposed a few decades ago to control LTI systems using periodic 
memoryless output feedback m- They show that introducing periodically vary¬ 
ing feedback can widen the eigenvalue assignment possibilities. We anticipate 
that those accelerations based on additional memory or time-dependent actions 
could also be linked to the memory effects and parallel actions present in quan¬ 
tum random walks [12) . We are currently working on formalizing this link in 
the emerging field of quantum systems engineering. 

Proposition 6. The set of all P 2 {P) is a non-convex subset of the set of all 

Pg 2 . 

Proof. Consider /C(l,4), the star graph on 5 nodes (see fig. [^. Let a given P' 
have equal positive weights only on the edges (1,2) and (1,3), and P” have the 
same positive weights only on (1,4) and (1,5). Then for P 2 ,P 2 some second-order 
polynomials, p' 2 (P') and p'^iP") correspond to positive weights on respectively 
(1,2),(1,3),(2,3) and (1,4),(1,5),(4,5). Their convex combination l/2p2(^") + 
l/2p'2{P') cannot be generated by any p 2 (P), because (strong) weights on (2,3), 
(4,5) require positive weights on all 4 edges of the star, which in turn unavoidably 
imply (non-negligible) positive weights on (2,4),(2,5),(3,4),(3,5) in any second- 

order p 2 (^’)- □ 


5 • 5 • 

\ 



4 • 4 • 


Figure 7: Left: P', dashed line added for P 2 {P'). 
Right: Convex combination. 




Center: idem for P" and p'fiP"). 
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